Re-mining Topics Popular in the Recent Past from a Large-Scale Closed Caption TV Corpus

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

"Draw My Topics": Find Desired Topics fast from large scale of Corpus

We develop the “Draw My-Topics” Toolkit, which provides a fast way to incorporate social scientists’ concerns and interests into the standard topic model. Instead of using raw corpus with primitive processing as input, an algorithm based on Vector Space Model and Conditional Entropy are used to connect social scientists’ subjective want and the unsupervised topic models’ output. Space for users...

متن کامل

Large Scale Corpus Analysis and Recent Applications

Recent progress of corpus and machine learning-based natural language processing methodologies have made it possible to handle large scale corpus with a quite high accuracy. The speaker is now involved in a project for constructing a large scale contemporary Japanese balanced corpus, aiming at constructing automatic annotation tools on various levels of natural language analyses. I will first i...

متن کامل

Mining Large-scale TV Group Viewing Patterns for Group Recommendation

We present a large-scale study of television viewing habits, focusing on how individuals adapt their preferences when consuming content in group settings. While there has been a great deal of recent work on modeling individual preferences , there has been considerably less work studying the behavior and preferences of groups, due mostly to the difficulty of data collection in these settings. In...

متن کامل

Project for Production of Closed-Caption TV Programs for the Hearing Impaired

We describe an on-going project whose primary aim is to establish the technology of producing closed captions for TV news programs efficiently using natural language processing and speech recognition techniques for the benefit of the hearing impaired in Japan. The project is supported by the Telecommunications Advancement Organisation of Japan with the help of the ministry of Posts and Telecomm...

متن کامل

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. In this paper, we particularly consider generating Japanese captions for images. Since most available caption datasets have been constructed for English language, there are few datasets for Japanese. To tackle this problem, we construct a large-scale Japane...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Future Computer and Communication

سال: 2015

ISSN: 2010-3751

DOI: 10.7763/ijfcc.2015.v4.364